Corpus: dan_news_2019_300K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 92 97 99 99 99
1000 818 946 985 987 990
10000 5800 8585 9697 9870 9931
100000 29466 64922 88470 96200 98612
1000000 62841 164509 247984 281191 292929


Zipf's diagram for sentence endings


Gnuplot diagram

16979 msec needed at 2024-08-23 14:12